Objective Speech Quality Estimation of In-Ear Microphone Speech
نویسندگان
چکیده
Speech captured from an in-ear microphone (IEM) under an intra-aural device is beneficial in extremely noisy environments as it maintains a relatively high signal to noise ratio. Due to its limited bandwidth, speech enhancement is required in order to obtain a more natural speech. Consequently, quick and practical measurement of speech quality is important. In this paper, we compare the performance of the quality of intrusive and non-intrusive objective quality metrics on IEM speech, and propose an adaptation of a non-intrusive metric, the speech-toreverberation modulation energy ratio (SRMR) to IEM speech signals. Changes are implemented to take into account the effect of the occluded ear on the recorded speech signals, which causes an amplification in the bone conduction sounds in the ear canal. We show that the updated SRMR metric, SRMRIEM, significantly reduces the performance gap between nonintrusive and intrusive metrics.
منابع مشابه
شکلدهی وفقی و هوشمند پرتو در آرایههای میکروفونی Ad-hoc با استفاده از خوشهبندی و رتبهبندی میکروفونها
Considering the existence of a many speech degradation factors, speech enhancement has become an important topic in the field of speech processing. Beamforming is one of the well-known methods for improving the speech quality that is conventionally applied using regular (classical) microphone arrays. Due to the restrictions in the regular arrangement of microphones, in recent years there has be...
متن کاملIn-Ear Microphone Equalization Exploiting an Active Noise Control
A pair of ear-muffs that employs active noise control (ANC) for noise reduction, substantially reduces the influence of the low frequencies inside the cap. This implies an indirect high-pass filtering of the sound in the external auditory canal (EAC). This paper shows that the above mentioned high-pass filtering property is convenient when combining an ANC headset with an in-ear microphone (ear...
متن کاملDistributed multichannel speech enhancement with minimum mean-square error short-time spectral amplitude, log-spectral amplitude, and spectral phase estimation
In this paper, the authors present optimal multichannel frequency domain estimators for minimum mean-square error (MMSE) short-time spectral amplitude (STSA), log-spectral amplitude (LSA), and spectral phase estimation in a widely distributed microphone configuration. The estimators utilize Rayleigh and Gaussian statistical models for the speech prior and noise likelihood with a diffuse noise f...
متن کاملAdaptive Time Domain Signal Estimation for Multi-Microphone Speech Enhancement
In this paper, the main ideas of a new method for multi-microphone speech enhancement are presented. Combining previously known multichannel speech enhancement methods with Maximum A Posteriori (MAP) estimation concepts, and temporally whitenning the output noises, we obtain a new algorithm called Adaptive Time-domain Signal Estimation for MultiMicrophone (ATSEM). The simulations (using real-wo...
متن کاملA generalized estimation approach for linear and nonlinear microphone array post-filters
This paper presents a robust and general method for estimating the transfer functions of microphone array post-filters, derived under various speech enhancement criteria. For the case of the mean square error (MSE) criterion, the proposed method is an improvement of the existing McCowan post-filter, which under the assumption of a known noise field coherence function uses the autoand cross-spec...
متن کامل